Corpus: cat_wikipedia_2021_300K

Other corpora

3.13.1 Average Position of Words by Word Length

Average position of words in sentences as a function of word length

word length count avg(pos) standard deviation(pos) avg(sentence_length)
1 396596 14.0995 9.3654 23.9885
2 1253163 11.3342 9.4306 23.6024
3 895581 11.7562 9.2882 23.6280
4 495128 12.3202 9.7184 23.6067
5 486799 12.2188 9.4995 23.4328
6 482625 12.1081 9.7343 23.4537
7 423069 12.0470 9.3191 23.3547
8 380468 12.2830 9.2832 23.1265
9 310263 12.1639 9.2887 22.9844
10 197271 12.3282 9.2697 23.2361
11 128986 12.4821 9.2165 23.3071
12 93214 12.4379 9.1676 23.2590
13 47557 13.3883 9.7802 23.2424
14 23724 12.6076 9.3768 23.1216
15 13338 12.4127 9.2306 23.1157
16 5772 12.6161 9.3268 22.9993
17 3327 12.1094 9.1311 22.9690
18 1399 12.4753 9.4386 23.2244
19 890 12.0371 9.5624 22.4573
20 816 11.5368 9.1000 23.4485
21 854 11.9930 9.3780 22.5902
22 731 11.9357 8.9498 22.5212
23 473 12.0592 9.0472 22.2558
24 564 12.5195 9.1722 22.2411
25 208 11.1106 8.9680 22.6154
26 144 13.7986 9.7187 24.4792
27 145 9.1310 7.6241 20.0690
28 63 10.6349 9.7952 21.9048
29 83 11.6024 9.0938 24.0602
30 89 10.7978 8.8383 23.3596
31 59 11.1356 7.5811 22.3898
32 56 10.7321 8.6282 20.8393
33 45 12.5111 9.3752 22.1778
34 53 12.2830 9.4856 20.3774
35 23 5.9565 6.7982 14.7391
36 21 11.5238 12.3119 20.8095
37 25 12.0000 7.8791 24.6000
38 17 9.5882 9.3624 19.2353
39 13 9.6154 9.6280 19.1538
40 9 9.8889 9.1826 26.2222
41 4 15.2500 8.8987 37.5000
42 17 8.2941 6.5867 23.3529
43 6 7.0000 3.9581 14.6667
44 9 6.0000 4.3716 25.5556
45 2 2.5000 2.5000 8.5000
46 4 10.5000 5.5000 16.7500
47 2 5.0000 0.0000 20.0000
48 4 9.0000 5.9161 22.2500
50 4 5.7500 2.8614 24.7500
51 2 0.0000 0.0000 9.0000


Gnuplot diagram

62973 msec needed at 2024-09-23 02:01